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DETAILED ACTION 
Response to Amendment 

1 . Applicant has submitted an amendment filed 12/7/2005, amending independent 
claims 1 and 7-8 (referring to claims), while arguing to traverse prior art rejection based 
on the amended limitation. Applicant's arguments have been fully considered. 
However, upon further consideration, a new ground(s) of rejection is made in view of 
Partovi et al. (US 6807574). 

Claim Rejections - 35 USC § 103 

2. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 

obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 1 02 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

3. Claims 1, 3, 5-8, and 11 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Junqua et al. (US 6415257) in view of Partovi et al. (US 6807574). 

4. Regarding claims 1 and 7, Junqua et al. disclose a dialog system and a method 
of operating a dialog system (figure 3) comprising processing units for automatic speech 
recognition (12 of figure 1), natural language understanding (24 of figure 1), defining 
system outputs in dependence on information derived from user inputs (col. 2, In. 28- 
31), generating acoustic and/or visual system outputs (col. 10 In. 65 to col. 1 1 , In. 3 
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and/or element 36 of figure 1), deriving user models (col. 2, In. 36-42, input speech 
signal is processed and parameterized for used in the speech recognition process), 
from determined details about the style of speech of user inputs and/or details about 
interactions in dialogs between users and the dialog system and adaptation of contents 
and/or form of system outputs in dependence on the user models (col. 2, In. 54 to col. 3, 
In. 26 or referring to figure 2, the user's profile includes a log that keeps track of user's 
view preferences and user's speech patterns). 

Junqua et al. fail to specifically disclose wherein the style of speech is 
determined based on factor selected from the group consisting of: the number of polite 
phrases used, address used, speech level, information density, vocabulary and use of 
foreign words, number of different words and classification of words of speech inputs 
with respect to rare occurrence. However, Partovi et al. teach wherein the style of 
speech is determined based on speech level (coL 12, lines 36-55, "southern dialect" 
reads on speech level as defined in paragraph 18 of the application) and classification 
of words of speech inputs with respect to rare occurrence (coL 13, lines 36-52, "San 
Francisco" are rare occurrence words). 

Since Junqua et al. and Partovi et al. are analogous art because they are from 
the same field of endeavors, it would have been obvious to one of ordinary skill in the 
art at the time of invention to modify Junqua et al. by incorporating the teaching of 
Partovi et al. in order to improve speech recognition accuracy. 
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5. Regarding claim 8, Junqua et al. disclose a process for television-user dialog, 
comprising the steps of: receiving user speech input (element 10 in figure 1)\ processing 
the speech input using automatic speech recognition and natural language 
understanding (elements 12 and 24 in figure 1)\ and defining at least one system output 
based on the speech input and a user model derived from details of the user style of 
speech inputs (coL 2, lines 54 to col. 3, line 67, speech/speaker adaptation). 

Junqua et al. fail to specifically disclose wherein the style of speech is 
determined based on factor selected from the group consisting of: the number of polite 
phrases used, address used, speech level, information density, vocabulary and use of 
foreign words, number of different words and classification of words of speech inputs 
with respect to rare occurrence. However, Partovi et al. teach wherein the style of 
speech is determined based on speech level (coL 12, lines 36-55, "southern dialect" 
reads on speech level as defined in paragraph 18 of the application) and classification 
of words of speech inputs with respect to rare occurrence (col. 13, lines 36-52, "San 
Francisco" are rare occurrence words). 

Since Junqua et al. and Partovi et al. are analogous art because they are from 
the same field of endeavors, it would have been obvious to one of ordinary skill in the 
art at the time of invention to modify Junqua et al. by incorporating the teaching of 
Partovi et al. in order to improve speech recognition accuracy. 

6. Regarding claim 3, Junqua et al. further disclose a dialog system characterized in 
that the user models contain estimates for the reliability of recognition results derived 
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from user inputs (col. 7, In. 1-32, the score associated with each candidate represents 
the reliability of each recognized candidate). 

7. Regarding claim 5, Junqua et al. further disclose a dialog system characterized in 
that fixed models of user stereotypes are used for forming the user models (col. 8, In. 8- 
26, a speaker adaptation process). 

8. Regarding claim 6, Junqua et al. further disclose a dialog system characterized in 
that user models are used which are continuously updated based on inputs of the 
respective user (col. 3, In. 1-27, the system includes a usage log recording user's 
everyday uses of the system). 

9. Regarding claim 1 1 , Junqua et al. further disclose the process of Claim 8, 
wherein the step of defining comprises the step of: defining at least one system output 
based on the speech input and a user model which includes a familiarity level, wherein 
the system output is based on the familiarity level {col. 3, lines 1-25, familiarity level is 
determined by how often and/or how long the user has used the system and that is 
specified in the usage log). 

10. Claims 2, 4, and 9-10 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Junqua et al. (US 6415257) in view of Partovi et al. (US 6807574), as applied to 
claims 1 and 8, and further in view of Larsen (IEEE Publication). 
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1 1 . Regarding claim 2, Junqua et al. further disclose a dialog system characterized in 
that in addition to the input modality to use user inputs by means of speech, at least a 
further input modality is provided (col. 3 f In. 35-44). Junqua et al. do not disclose a 
dialog system characterized in that the user models contain details about the respective 
use of the various input modalities by the user. 

However, Larsen teaches a bi-modal application used in a dialog system, where 
a DTMF input mode is used if repeated recognition errors occur in the speech 
recognition mode (referring to APPLICATION SECTION on pages 66-67). The 
advantage of using the teaching of Larsen in Junqua et al. is to enable the system to 
take appropriate actions to process the input signal to achieve high accuracy. 

Since Junqua et al. and Larsen are analogous art because they are from the 
same field of endeavors it would have been obvious to one of ordinary skill in the art at 
the time of invention to modify Junqua et al. by incorporating the teaching of Larsen in 
order to enable the system to take appropriate actions to process the input signal to 
achieve high accuracy. 

The modified Junqua et al. still fail to disclose a dialog system characterized in 
that the user models contain details about the respective use of the various input 
modalities by the user. However, it would have been obvious to one of ordinary skill in 
the art at the time of invention to readily realize that both DTMF and speech input 
modes, as taught by Larsen, are different and both are represented by two distinct 
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signals. Therefore, the system would have distinguished and processed these two 
signals differently in order to enhance the system's efficiency and reliability. 

12. Regarding claim 4, Junqua et al. do not disclose a dialog system characterized in 
that in dependence on the estimates, system responses are generated which prompt 
the respective user to use such input modalities for which high estimate values were 
determined and/or which prevent the respective user from using input modalities for 
which low reliability values were determined. 

However, Larsen teaches a dialog system characterized in that in dependence 
on the estimates, system responses are generated which prompt the respective user to 
use such input modalities for which high estimate values were determined and/or which 
prevent the respective user from using input modalities for which low reliability values 
were determined (referring to APPLICATION SECTION on pages 66-67). The 
advantage of using the teaching of Larsen in the modified Junqua et al. is to allow the 
system to switch to a different input mode to achieve high recognition accuracy. 

Since the modified Junqua et al. and Larsen are analogous art because they are 
from the same field of endeavors, it would have been obvious to one of ordinary skill in 
the art at the time of invention to further modify Junqua et al. by incorporating the 
teaching of Larsen in order to allow the system to switch to a different input mode to 
achieve high recognition accuracy. 



Application/Control Number: 09/954,657 Page 8 

Art Unit: 2655 

13. Regarding claim 9, Junqua et al. further disclose the process of Claim 8, wherein 
the step of defining comprises the step of: defining at least one system output based on 
the speech input and a user model which includes an experience level (col. 3, lines 1- 
25, experience level is determined how often and how long the user has used the 
system and that is specified in the usage log). 

Junqua et al. fail to specifically disclose that the system output is based on the 
experience level of the user model in that if the experience level is low, the system 
output is a first length, while if the experience level is high, the system output is a 
second length lesser than the first length. However, Larsen teaches the step of 
advising the user which input modalities to use according to their experience level 
{Application Section on pages 66-67, it is inherent that the advise phrase outputted by 
the system for the experience user is shorter than for inexperience user, (e.g. "do" and 
"do not")). 

Since Junqua et al. and Larsen are analogous art because they are from the 
same field of endeavors, it would have been obvious to one of ordinary skill in the art at 
the time of invention to modify Junqua et al. by incorporating the teaching of Larsen in 
order to allow the system to switch to a different input mode to achieve effective 
communication between the user and the system. 

14. Regarding claim 10, Junqua et al. further teach the process of Claim 8, wherein 
the step of defining comprises the step of: defining at least one system output based on 
the speech input and a user model, wherein the system output is based on the likely 
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input modality (col. 3, lines 1-67). Junqua et al. fail to specifically disclose a user model, 
which includes a likely input modality for a current prompt. However, Larsen teaches a 
user model, which includes a likely input modality for a current prompt (referring to 
APPLICATION SECTION on pages 66-67). 

Since Junqua et al. and Larsen are analogous art because they are from the 
same field of endeavors it would have been obvious to one of ordinary skill in the art at 
the time of invention to modify Junqua et al. by incorporating the teaching of Larsen in 
order to enable the system to take appropriate actions to process the input signal to 
achieve high accuracy. 

15. Claim 12 is rejected under 35 U.S.C. 103(a) as being unpatentable over Junqua 
et al. (US 6415257) in view of Partovi et al. (US 6807574), as applied to claim 8, and 
further in view of Toyama et al. (US 6502082). 

16. Regarding claim 12, Junqua et al. fails to specifically disclose the process of 
claim 8 further comprising the steps of: receiving a user face image: and determining a 
degree of despair based on the user face image (col. 7, lines 38-54)] wherein the step 
of defining comprises the step of: defining at least one system output based on the 
degree of despair (co/. 1, lines 38-54). However, Toyama et al. teach the steps of: 
receiving a user face image: and determining a degree of despair based on the user 
face image (col. 1, lines 38-54)] wherein the step of defining comprises the step of: 
defining at least one system output based on the degree of despair (col. 1, lines 38-54). 
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Since Junqua et al. and Toyama et al. are analogous art because they are from 
the same field of endeavors it would have been obvious to one of ordinary skill in the art 
at the time of invention to modify Junqua et al. by incorporating the teaching of Toyama 
et al. in order to specify the system to provide appropriate services for the user. 



Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Huyen X. Vo whose telephone number is 571-272-7631 . 
The examiner can normally be reached on M-F, 9-5:30. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Wayne Young can be reached on 571-272-7582. The fax phone number for 
the organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). \ / — \ / 



Conclusion 



HXV 



WAYNE YOUNG 
SUPERVISORY PATENT EXAMINEI 




1/23/2006 



*** 



